Modeling Query Term Dependencies in Information Retrieval with Markov Random Fields
نویسندگان
چکیده
This paper develops a general, formal framework for modeling term dependencies via Markov random fields. The model allows for arbitrary text features to be incorporated as evidence. In particular, we make use of features based on occurrences of single terms, ordered phrases, and unordered phrases. We explore full independence, sequential dependence, and full dependence variants of the model. A novel approach is developed to train the model by directly maximizing mean average precision. Our results show that significant improvements are possible by modeling dependencies, especially on larger web collections.
منابع مشابه
Beyond Bags of Words: Modeling Implicit User Preferences in Information Retrieval
This paper reports on recent work in the field of information retrieval that attempts to go beyond the overly simplified approach of representing documents and queries as bags of words. Simple models make it difficult to accurately model a user’s information need. The model presented in the paper is based on Markov random fields and allows almost arbitrary features to be encoded. This provides ...
متن کاملSource Code Retrieval from Large Software Libraries for Automatic Bug Localization
Sisman, Bunyamin Ph.D., Purdue University, December 2013. Source Code Retrieval from Large Software Libraries for Automatic Bug Localization. Major Professor: Avinash C. Kak. This dissertation advances the state-of-the-art in information retrieval (IR) based approaches to automatic bug localization in software. In an IR-based approach, one first creates a search engine using a probabilistic or ...
متن کاملAn Information Retrieval Expansion Model Based on Quasi-Clique
Query expansion is an important technology for improving retrieval performance in information retrieval. Many Studies have found contexts within query that strongly influence the interpretation of a query. In this paper, we propose the graph mining technique called Quasi-Clique as query context in Markov network retrieval model. Our approach exploits contextual information mined from the term M...
متن کاملImproving Query Expansion for Information Retrieval Using Wikipedia
Query expansion (QE) is one of the key technologies to improve retrieval efficiency. Many studies on query expansion with relationships from single local corpus suffer from two problems resulting in low retrieval performance: term relationships are limited and unlisted query terms have no expansion terms. To address these problems, relationships between terms captured from Wikipedia are superim...
متن کاملIncorporating Semantic Knowledge with MRF Term Dependency Model in Medical Document Retrieval
Term dependency models are generally better than bag-ofword models, because complete concepts are often represented by multiple terms. However, without semantic knowledge, such models may introduce many false dependencies among terms, especially when the document collection is small and homogeneous(e.g. newswire documents, medical documents). The main contribution of this work is to incorporate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005